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The severe acute respiratory syndrome coronavirus (SARS-CoV) possesses a large 29.7-kb positive-stranded 
RNA genome. The first open reading frame encodes replicase polyproteins la and lab, which are cleaved to 
generate 16 “nonstructural” proteins, nspl to nspl6, involved in viral replication and/or RNA processing. 

Among these, nsplO plays a critical role in minus-strand RNA synthesis in a related coronavirus, murine 
hepatitis virus. Here, we report the crystal structure of SARS-CoV nsplO at a resolution of 1.8 A as determined 
by single-wavelength anomalous dispersion using phases derived from hexatantalum dodecabromide. nsplO is 
a single domain protein consisting of a pair of antiparallel N-terminal helices stacked against an irregular 
(3-sheet, a coil-rich C terminus, and two Zn fingers. nsplO represents a novel fold and is the first structural 
representative of this family of Zn finger proteins found so far exclusively in coronaviruses. The first Zn finger 
coordinates a Zn 2+ ion in a unique conformation. The second Zn finger, with four cysteines, is a distant 
member of the “gag-knuckle fold group” of Zn 2+ -binding domains and appears to maintain the structural 
integrity of the C-terminal tail. A distinct clustering of basic residues on the protein surface suggests a nucleic 
acid-binding function. Gel shift assays indicate that in isolation, nsplO binds single- and double-stranded RNA 
and DNA with high-micromolar affinity and without obvious sequence specificity. It is possible that nsplO 
functions within a larger RNA-binding protein complex. However, its exact role within the replicase complex 
is still not clear. 


The severe acute respiratory syndrome coronavirus (SARS- 
CoV) is a positive-stranded RNA virus with a large 29.7-kb 
genome that encodes 14 open reading frames (ORFs) (26, 30). 
The first of these ORFs covers two-thirds of the genome and 
codes for the replicase polyproteins la and lab (ppla and 
pplab) (30). pplab is formed by frame shifting of the ribosome 
into the —1 frame during translation, just prior to the ppla 
stop codon (2, 30). The two polyproteins are cleaved into 16 
“nonstructural” proteins, nspl to nspl6, by two viral cysteine 
proteases, a 3CL-like protease and a papain-like protease (10, 
21, 22). These proteolytically generated proteins and/or their 
cleavage intermediates are involved in viral replication and/or 
generation of the nested subgenomic mRNAs required for 
expression of the downstream ORFs within the host cell (32). 
The remaining ORFs can be categorized into the structural 
proteins (those that are part of the virion) and the “accessory” 
proteins. Immunofluorescence microscopy studies in coronavi¬ 
ruses such as murine hepatitis virus (MHV) reveal that some, 
if not all, of the nsp proteins, including pl5 (counterpart to 
SARS-CoV nsplO), assemble into distinct cytoplasmic, mem¬ 
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brane-associated replicase complexes that actively perform vi¬ 
ral RNA synthesis (21, 24). 

We have recently undertaken an initiative to generate a 
structure-function-interaction map of the entire proteome of 
SARS-CoV and its interactions with the host cell, presenting a 
comprehensive set of targets for therapeutic intervention (http: 
//sars.scripps.edu/). To this end, we are using a structural and 
functional proteomics approach employing bioinformatics, 
structural biology, genetic approaches, and macromolecular 
interaction studies. Our studies and those of other laboratories 
on the SARS-CoV replicase proteins are beginning to provide 
the first clues as to the complex replication, transcription, and 
RNA processing events unique to the life cycle of coronavi¬ 
ruses. We recently determined the nuclear magnetic resonance 
structure of nsp7 and showed that it adopts a novel fold con¬ 
sisting of five helices, three of them forming a sheet (20). 
Subsequently, Zhai et al. (31) determined the crystal structure 
of a supercomplex consisting of eight subunits each of nsp7 and 
nsp8. This hollow cylinder-like assembly allows the passage of 
RNA and likely forms the core framework of the replicase 
complex. The central channel of the cylinder has an internal 
diameter of ~30 A and is positively charged, favoring nucleic 
acid binding, implying a role in conferring processivity to the 
RNA-dependent RNA polymerase (31). It has been demon¬ 
strated that nsp8 interacts with nsp9 (27), and the structure of 
the latter has been independently determined by two groups 
(9, 27). nsp9 binds single-stranded RNA (ssRNA) and double- 
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FIG. 1. Sequence of SARS-CoV nsplO. Shown is a schematic of the SARS-CoV genome depicting the location of the replicase polyprotein and 
the different structural and accessory genes. The immediate neighborhood of nsplO within ppla/lab is expanded in the inset, under which a 
sequence alignment of different homologues of nsplO in coronaviruses is shown. RdRp, RNA-dependent RNA polymerase; TGEV, porcine 
transmissible gastroenteritis virus; IBV, avian infectious bronchitis virus; Hel, helicase; ExoN, exonuclease; Nendoll, endonuclease; 2'-o-MT, 
2'-o-methyltransferase; EDV, epidemic diarrhea virus. 


stranded DNA (dsDNA) without obvious sequence specificity, 
optimally binding 45-mers that likely wrap once around a 
dimer of the protein (9, 27). 

In this paper, we describe the structure of nsplO, a protein 
located in the SARS-CoV proteome just downstream of key 
replicase complex proteins: the nsp7-nsp8 supercomplex as 
well as the RNA-binding nsp9 on one end and the RNA poly¬ 
merase (nspl2) on the other (26). The intervening nspll is a 
short 13-residue peptide (Fig. 1). Homologues of nsplO are 
found in all coronaviruses (Fig. 1); nsplO is among the more 
conserved of the SARS-CoV nsp proteins. nsplO has been 
shown to play a critical role in the replicase complex, as a 
temperature-sensitive mutation of this protein in MHV pre¬ 
vented minus-strand RNA synthesis at the nonpermissive tem¬ 
perature (25). The structure of SARS-CoV nsplO was solved to 
a resolution of 1.8 A by single-wavelength anomalous disper¬ 
sion (SAD) on a hexatantalum dodecabromide (Ta 6 Br 12 2+ )- 
derivatized crystal. nsplO contains two Zn 2+ -binding sites, an 
N-terminal subdomain rich in helices, and a C-terminal do¬ 
main consisting of (3-strands and loops. The topological ar¬ 
rangement seen in this structure has not, to the best of our 
knowledge, been previously observed and can therefore be 


regarded as a new protein fold. The unique arrangement of 
residues that coordinate the first Zn 2+ ion (involving a histi¬ 
dine and three cysteines) defines a new class of Zn fingers, 
while the second Zn finger has features of the gag-knuckle fold 
group (14). The physical location of nsplO in the SARS-CoV 
proteome within a cluster of proteins which interact with RNA 
(26) as well as the occurrence of two Zn fingers and a well- 
defined positively charged patch on the protein surface hinted 
at a possible RNA-binding function. However, RNA-binding 
experiments indicate that while nsplO binds to both single- 
stranded and double-stranded DNA and RNA, it does so with 
only high-micromolar affinity. This does not rule out the pos¬ 
sibility that it may bind specifically and/or with greater affinity 
to nucleic acid as part of a larger complex that confers proces- 
sivity during replication of the genomic or subgenomic RNA of 
the virus. 

MATERIALS AND METHODS 

Construct design, cloning, expression, and purification. The predicted full- 
length sequence of nsplO (gi:29837503; NP_828868) extends from nucleotides 
12955 to 13371 of the SARS-CoV genome, corresponding to residues Ala4231 to 
Gln4369 of ppla/pplab (26). A construct corresponding to residues 10 to 139 of 
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nsplO (truncating the first nine residues predicted to be random coiled) was 
amplified by PCR from genomic cDNA of the SARS-CoV Tor-2 strain using Taq 
polymerase and primer pairs encoding the 5' and 3' ends (forward, 5'-ATGAA 
TTCAACTGTGCTTTCCTTCTGTGCTAGGGCCGGCCGGG-3'; reverse, 5'- 
CTGCATCAAGGGTTCGCGGACCCGGCCGGCCCTA-3'). The PCR prod¬ 
uct was cloned into the expression vector pMHIF, a derivative of pBAD 
(Invitrogen). Expression in pMHIF is driven by the araBAD promoter, and the 
recombinant protein is produced with an N-terminal Thio 6 His 6 tag (MGSDKI 
HHHHHH). 

A sequence-verified clone was transformed into the Escherichia coli methi¬ 
onine-auxotrophic strain DL41. An overnight culture from a fresh transformant 
was used to inoculate flasks of 2XYT-ampicillin media. The culture was grown at 
37°C with vigorous shaking to an optical density at 600 nm of 0.6 to 0.7, induced 
with 0.2% (wt/vol) L-arabinose, and grown at 14°C for 8 h. The cells were 
harvested by centrifugation and lysed by sonication in buffer containing 50 mM 
potassium phosphate, pH 7.8, 300 mM NaCl, 10% glycerol, 5 mM imidazole, 0.5 
mg/ml lysozyme, 100 jxl/liter benzonase, and EDTA-free protease inhibitor (one 
tablet per 50 ml buffer; Roche). The lysate was clarified by ultracentrifugation at 
100,000 X g for 45 min at 4°C, and the soluble fraction was applied onto a metal 
chelate column (Talon resin charged with cobalt; Clontech). The column was 
washed with 20 mM Tris, pH 7.8, 300 mM NaCl, 10% glycerol, 5 mM imidazole 
and eluted with 25 mM Tris, pH 7.8, 300 mM NaCl, 150 mM imidazole. The 
eluate was then purified by anion exchange on a Poros HQ column using a linear 
gradient of NaCl (0 to 1 M) in 25 Tris-HCl, pH 8.0, followed by size exclusion on 
a Superdex 75 column equilibrated with 10 mM Tris, pH 7.8,150 mM NaCl. Pure 
fractions of nsplO were concentrated to 3.5 mM and either used immediately for 
crystallization trials or flash frozen in liquid nitrogen for later use. 

Crystallization and data collection. nsplO was crystallized using the nanodro¬ 
plet vapor diffusion method (23). Drops (100 nl 3.5 mM protein plus 100 nl 
crystallant) were dispensed into 96-well low-profile Greiner plates using an 
Innovadyne liquid-handling robot. After 21 days at 4°C, harvestable cubic crystals 
(—50 [im by —50 |xm by —50 |xm) were obtained in 2.0 M ammonium formate, 
0.1 M HEPES, pH 7.5. A native 1.8- A data set (at a wavelength of 1.0 A) was 
collected on beamline 8.2.1 at the Advanced Light Source (Berkeley, CA). 
Native crystals were derivatized by being soaked for 3 days in a solution of the 
Ta 6 Br 12 2+ cluster compound (1 mM) in cryoprotectant (30% glycerol in reser¬ 
voir solution). The use of Ta 6 Br 12 2+ in the derivatization of crystals of large 
macromolecules has been described earlier (1). The cluster is a regular octahe¬ 
dron of six Ta atoms with 12 bridging Br atoms at the edges of the octahedron. 
Both Ta and Br atoms exhibit large anomalous dispersion signals at their ab¬ 
sorption edges of 9,918.74 eV (1.25 A) and 13,476.54 eV (0.92 A), respectively. 
Single-wavelength anomalous diffraction (SAD) data were collected to 2.11 A on 
a derivatized crystal at the same beam line at a wavelength of 1.2553 A, corre¬ 
sponding to the inflection point of the Ta L-III absorption edge. Reflections of 
both native and SAD datasets were indexed in I-centered cubic lattices (space 
group 123), integrated, and scaled using HKL2000 (19). The Laue group was 
later identified to be I2 X 3. 

Structure determination and refinement. Initial phases were obtained by SAD 
phasing with the 2.11 -A data set at the Ta inflection wavelength using the 
program SOLVE (29). While six Ta sites (corresponding to a single Ta 6 cluster) 
were obtained in both space groups 123 and I2 X 3, an interpretable map was 
obtained only in 12^. The resulting phases had a figure of merit of 0.68 after 
density modification procedures using RESOLVE (28). The resultant SAD 
phases were merged, improved, and extended for using the 1.8-A native data set 
using the programs CAD and DM as implemented in the CCP4 package, as¬ 
suming 1 monomer in the asymmetric unit with a Matthews coefficient of 3.4 and 
solvent content of 63.7% (7). Automated model building using Arp/wARP (15) 
traced —85% of the backbone and docked 65% of the sequence, including the 
side chains. The rest of the sequence was manually built into the density using 
“O” (13) and refined against the high-resolution native data to 1.8 A with 
iterative rounds of model building and refinement using Refmac5 (18) of CCP4 
(6). A summary of data collection and refinement statistics is shown in Table 1. 
The stereochemical quality of the final refined model was checked using Pro¬ 
check (16), and ribbon diagrams were made using Pymol (8). The coordinates 
and the structure factors have been deposited in the Protein Data Bank (PDB). 

Electrophoretic mobility shift assay. Gel mobility shift assays were used to 
measure the affinity of nsplO binding to nucleic acid. RNA and DNA oligomers 
designed to generate double-stranded hairpins, dimers, and concatemer confor¬ 
mations (5 '-GTTAAGGCCTTAAGGCCTTTAAAGGCCTTAAGGCCTTAA 
C-3') were synthesized commercially. RNA oligonucleotides were heated to 65°C 
for 5 min and cooled slowly to room temperature, while DNA oligonucleotides 
were boiled for 5 min and cooled similarly to generate double-stranded nucleic 
acid. Single-stranded RNA (5'-AUAUGGAAGAGCCCUA-3') was also synthe- 


TABLE 1. Data collection and refinement statistics^ 


Statistical element* 

Crystal type 

TaBr (infection XT .. 

wavelength) NatlVe 

Data collection statistics 

Space group 

12,3 

12,3 

Cell dimensions 

a = b = c = 

a = b = c = 

Wavelength (A) 

108.302 A 

108.874 A 

1.2553 

1.0000 

Resolution range (A) 

50.0-2.1 

50.0-1.8 

No. of observations 

252, 264 

316, 394 

No. of unique reflections 

1,2407 

19,035 

Completeness (%) 

99.8 (99.3) 

99.97(100.0) 

Redundancy 

9.2 

5.7 

Mean //ct(I) 

12.85 (2.5) 

22.96 (2.36) 

R sym a on I 

Highest resolution shell (A) 

0.110 (0.685) 

0.048(0.600) 

2.18-2.10 

1.86-1.80 

Refinement statistics 

R 

0.196 (0.27) 


^free 

0.233 (0.298) 


Protein atoms (avg. B factor) 

961 (24.8) 


Solvent atoms (avg. B factor) 

188 (38.5) 


Hetero atoms (avg. B factor) 

8 (37.8 ) 


RMSD^ bond length 

0.014 A 


RMSD bond angle 

1.49° 


Stereochemical validation 

Most favored (non-Gly, 

93 (86.9%) 


non-Pro) 

Generously allowed 

14 (13.1%) 


Additionally allowed 

0 


Disallowed 

0 



° Rsym = Wft- Ij ~ </> )/%j Ijl 

b -^work = ^hkl — FyShk! F 0 , where F 0 and F c are the observed and calcu¬ 
lated structure factors, respectively. 

c Five percent (1,023 reflections) of randomly chosen reflections was used in 
the calculation of R^^. 

d Values in parentheses are for data corresponding to the outermost shell. 

* For the phasing step, the following values were obtained: figure of merit 
SOLVE, 0.36; RESOLVE, 0.68; DM, 0.76. 
f RMSD, root mean square deviation. 


sized commercially. Nucleic acid substrates were resuspended at 1 mg/ml in 
water containing 0.1% sodium dodecyl sulfate to prevent RNase degradation. 
Binding mixtures of 5 |xg nucleic acid (80 to 100 pmol) were incubated with 
different concentrations of nsplO at 37°C for 1 h. Binding mixtures were sepa¬ 
rated by native polyacrylamide gel analysis on precast 6% acrylamide DNA 
retardation gels (Invitrogen). Nucleic acid was detected by staining with SYBR- 
gold stain (Molecular Probes), which is suitable for dsDNA, dsRNA, and ssRNA 
detection. SYBR-gold fluorescence was imaged on a UV light source equipped 
with a digital camera. Protein was then detected by SYPRO-ruby poststain 
according to the manufacturer’s protocol (Molecular Probes). 

Protein structure accession number. The coordinates and the structure factors 
have been deposited in the PDB under accession number 2FYG. 

RESULTS 

Structure determination by Ta SAD phasing. Crystals of 
SARS-CoV nsplO soaked in Ta 6 Br 12 2+ were used to obtain 
phases for structure determination. Characteristic of tantalum 
bromide derivatization (1), large dispersive anomalous differ¬ 
ences were observed in the low-resolution shells (>5.0 A), 
which diminished from 4.5 to 3.0 A and then increased again 
from 3.0 to 2.1 A in a highly redundant single-wavelength 
anomalous data set collected at the L-III absorption edge of 
Ta. Five of the six sites had reasonable occupancies (0.2 to 0.8) 
and B (temperature) factors (0.67, 0.63, 0.24, 0.22, 0.21). The 
Ta 6 Br 12 2+ cluster was found in the interstitial bulk solvent 
space near one of the threefold axes. This region was encased 
by six protein monomers that formed two distinct types of 
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FIG. 2. Structure of SARS-CoV nsplO. (a) Ribbon diagram of SARS-CoV nsplO showing the arrangement of helices and strands. The 
secondary structures are colored from blue (N terminus) to red (C terminus) and are numbered from HI to H5 for helices and 1 to 5 for the 
(3-strands, (b) Topology diagram showing the connectivities between the secondary structural elements in the nsplO structure. Helices are in cyan 
and strands are in yellow, with the same numbering scheme as that described for panel a. (c) Electron density observed at the first Zn 2+ -binding 
site. The residues coordinating the Zn 2+ ion are shown as balls and sticks, (d) Electron density observed at the second Zn 2+ -binding site. The four 
cysteine residues coordinating the metal ion at the second Zn 2+ ion near the protein C terminus are shown as balls and sticks. The 2 F a -F c maps 
are contoured at 1.0 <r, where F a and F c are the observed and calculated structure factors, respectively. 


trimers. The N-terminal His tag residues from one trimer and 
two polar loops from the other trimer aided in encasing the 
cluster. The final model (Fig. 2a) contains amino acids 10 to 
132 of SARS-CoV nsplO, along with five residues at the N 
terminus contributed by the purification tag (6X His tag) of the 
construct. We did not see density for five residues at the C 
terminus. The final R and R (rec values are 19.5 and 23.0%, 
respectively. Data collection and refinement statistics as well as 
stereochemical parameters are listed in Table 1. 

nsplO adopts a new fold. The nsplO monomer seen in the 
asymmetric unit has a single domain that can be roughly seg¬ 
regated into three regions: a largely helical subdomain at the N 
terminus, an irregular (3-sheet region, and a C-terminal region 
consisting mostly of loops (Fig. 1). The helical region encom¬ 
passes two antiparallel helices at the N terminus (HI and H2) 
along with three other small helices, H3, H4, and H5. This is 
followed by a large loop that connects the helical hairpin to an 
irregularly curved (3-sheet that resembles an imperfect (3-barrel 
with strands in the order 2, 3, 1, 4, 5. The hydrogen bonding 


pattern is broken at multiple regions between the strands of 
the barrel, and the first [3-strand is considerably bent compared 
to the other four. The middle of the polypeptide consists of 
three small helices and a large loop insertion. The C-terminal 
region of the polypeptide (residues 112 to 131) is largely un¬ 
structured, with irregular loops. A schematic of the topology of 
the nsplO fold is shown in Fig. 2b. 

We observed clear electron densities suggesting two bound ions 
in the structure (Fig. 2c and d). An X-ray absorption spectroscopy 
experiment yielded a strong peak at 9,658 eV corresponding to 
the K edge of Zn (data not shown), indicating that nsplO contains 
Zn. That both electron densities correspond to Zn 2+ ions was 
confirmed by analyzing a Zn anomalous difference map (data not 
shown). We therefore modeled two Zn 2+ ions into the electron 
densities; when refined with full occupancies, these have B factors 
(21.24 and 36.42) that are comparable to those of the rest of the 
protein molecule (24.8). 

Structure comparison using the entire polypeptide chain 
against known structures in the Protein Data Bank (PDB) (3) 
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nspIO WT 



modeled mutation 


(C) 

FIG. 3. Analysis of electrostatic charge distribution and symmetry mates of nspIO. (a) The surface of one of the nspIO monomers is shown with 
electrostatic potential colored from blue (positive) to red (negative) in the range of +4.1 kT to —4.1 kT. Important residues that contribute to the 
positive charge are shown as balls and sticks over a semitransparent protein surface. The two metal ions are shown as spheres, (b) Two 
symmetry-related dimers observed in the crystal. The two zinc atoms Znl and Zn2 are shown, (c) Modeled effect of a Q65E mutation on the surface 
charge of nspIO. A large contiguous negatively charged surface patch is generated which may unfavorably alter a critical binding interface with an 
interacting protein and/or RNA. This mutation causes a temperature-sensitive defect in minus-strand RNA synthesis in MHV (25). WT, wild type. 


by DALI (11) did not yield any hits with significant Z scores 
(>2.0). DALI searches with the individual subdomains (with 
five helices as one and the (3-sheet as the other) also did not 
yield any structural homologues, and neither did various com¬ 
binations of loop inclusions/deletions with these two domains, 
indicating that nspIO adopts a novel fold. Sequence homo¬ 
logues of SARS-CoV nspIO are found only in related corona- 
viruses (Fig. 1) and are notably absent among prokaryotes or 
eukaryotes. None of these homologues have been structurally 
characterized; the SARS-CoV nspIO structure is the first struc¬ 
tural representative of this family of proteins. 

nspIO is a Zn finger protein. The two Zn fingers present in 
the structure of SARS-CoV nspIO were compared with known 
Zn 2+ -binding motifs (14). Several unique conformational fea¬ 
tures are seen in both Zn fingers. The first Zn finger is present in 
the middle of the polypeptide chain, where the metal ion is co¬ 
ordinated by three cysteines and a histidine residue (C74, C77, 
H83, and C90; Fig. 2c). Of these, C77 lies in a short, single-turn 


a-helix (H4), while the other three are in loops. The conforma¬ 
tion of protein residues in the vicinity of the Zn 2+ ion are such 
that this site cannot be placed in any of the known classes of 
Zn 2+ -binding sites (14). The only group in which it can be placed 
is the loosely defined category of “Zn 2+ -binding loops” (14). The 
second Zn 2+ finger is coordinated by four cysteine residues at the 
C-terminal tail of the protein (C117, C120, C128, and C130; Fig. 
2d). Conformational comparison of the protein backbone as well 
as the side chains of the four cysteines in the vicinity of this Zn 2+ 
finger suggests this motif is a member of the gag-knuckle fold 
group (14). Specifically, it belongs to the polymerase gag-knuckle 
family, members of which include one of the Zn fingers of the 
large subunit of yeast RNA polymerase (residues 63 to 82 of PDB 
no. 1I3Q) and the dsRNA-binding protein sigma3 of the reovirus 
outer capsid (residues 48 to 77 of PDB no. 1FN9). The primary 
function of a majority of Zn fingers of this family is to bind single- 
and/or double-stranded RNA; however, it may also be a protein- 
protein interaction module (14). Surface charge distribution in- 
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dsRNA dsDNA ssRNA Protein Only 

FIG. 4. Nucleic acid binding by nsplO. Binding mixtures containing (A) 80 pmol double-stranded RNA, (B) 82 pmol double-stranded DNA, 
(C) 100 pmol single-stranded RNA, or (D) no nucleic acid were incubated with various concentrations of nsplO. Binding mixtures shown in lanes 
0 to 7 contained 0, 10, 20, 40, 80, 160, 320, and 800 p,M nsplO, respectively. Free and bound nucleic acids were detected by SYBR-gold staining 
(Invitrogen). A double-stranded DNA ladder (1 Kb Plus; Invitrogen) was included as a marker (lane M). The positions of free nucleic acid (F), 
shifted nucleic acid-nsplO complexes (S), and supershifted multiplexes (O) on the gels are indicated at the left. The shifted and supershifted nucleic 
acid peaks coincide with SYPRO-ruby protein stain in each gel (not shown). 


dicates that this Zn finger lies in a negatively charged region that 
is clustered with Asp and Glu residues (Fig. 3b). It is noteworthy 
that while many of the residues near these two metal ions are 
exposed to the solvent, the Zn 2+ ions are themselves buried. 

A distinct positively charged patch is also noticeable on one 
side of the protein surface (Fig. 3a). Extending to almost 30 A 
along one direction, this positive patch encompasses the first 
Zn finger (Zn atom labeled Znl in Fig. 3a) along with several 
residues that are conserved across the different viral nsplOs. A 
few important residues that contribute to this positive charge 
include Lys93, Arg78, Serll, and Asn40. Such a large positive 
surface coupled with the observation of two Zn fingers points 
to the possibility of an RNA-binding function for this protein. 
The charge distribution as seen in two different dimers com¬ 
posed of symmetry-related monomers as observed from crystal 
packing is shown in Fig. 3b. The second Zn finger (Zn atom 
labeled Zn2) appears to play a critical role in both structural 
integrity of the C-terminal half of the structure as well as its 
oligomerization, as evidenced by the close proximity and its 
interaction in the two symmetry-related molecules (Fig. 3b). 
Analytical size exclusion chromatography indicates that nsplO 
is a dimer in solution (data not shown). The high symmetry of 
the I-centered cubic lattice makes it difficult to predict which of 
the several symmetry-related oligomers is physiologically rele¬ 
vant. Two of these are shown in Fig. 3b. 

Several proteins in the SARS-CoV proteome possess one or 
more Zn 2+ -binding motifs: nsplO, the papain-like protease 
domain of nsp3, the N-terminal domain of nspl3 (helicase/ 
NTPase), and nspl4 (exonuclease) (26). nsp2 and the N-ter¬ 
minal domain of nspl2 (RNA-dependent RNA polymerase) 
also contain cysteine-rich regions that may harbor Zn fingers. 
Zn fingers are found in proteins that perform diverse functions 
in various cellular processes: replication, repair, transcription, 
translation, metabolism, signaling, cell proliferation, and apop¬ 
tosis (see reference 14 for a review). They function as interac¬ 
tion modules and bind to a wide variety of ligands including 
proteins and small molecules, but most often they bind nucleic 
acids. 

Physiological role of nsplO. nsplO is found in all coronavi- 
ruses, and its sequence is highly conserved (between 51 and 


56% identity) across the entire length of the protein (Fig. 1), 
suggesting its importance in the coronavirus life cycle. In 
MHV, there is evidence that nsplO plays a critical role in the 
replicase complex: a temperature-sensitive Gln65Glu mutation 
caused a defect in minus-strand RNA synthesis at the nonper- 
missive temperature of 40°C, while plus-strand synthesis ap¬ 
peared to be unaffected (25). Gln65 is absolutely conserved in 
all coronavirus nsplO proteins. When modeled on the SARS- 
CoV nsplO structure, mutation of Gln65 to a Glu residue 
generates a large contiguous negatively charged surface patch 
(Fig. 3c) which may unfavorably alter a critical binding inter¬ 
face with an interacting protein and/or RNA. Immunofluores¬ 
cence studies using confocal microscopy in MHV-infected cells 
have demonstrated that pl5 (MHV counterpart of SARS-CoV 
nsplO) colocalizes with p28, p65, 3CL protease, plO, p22, pl2, 
the RNA-dependent RNA polymerase, the helicase, and the 
nucleocapsid protein (counterparts of SARS-CoV nspl, nsp2, 
nsp5, nsp7, nsp8, nsp9, nspl2, nspl3, and sars9a, respectively) 
at the site of replication complexes (4, 5). Further, specific 
interactions between MHV pl5, p28, and plO were seen in 
yeast two-hybrid and coimmunoprecipitation experiments, sug¬ 
gesting a direct role of nsplO in the replicase complex (5). 

In the pplab polyprotein, SARS-CoV nsplO is preceded by 
three proteins demonstrated to interact with RNA (nsp7, nsp8, 
and nsp9) (9, 27, 31) and is followed by the suite of putative 
viral RNA-modifying enzymes (nspl2 to nspl6) (26). The 
proximity of nsplO to viral RNA-binding and RNA-modifying 
proteins, the presence of two coordinated Zn 2+ ions in the 
crystal structure, and the presence of the highly conserved 
positively charged face (Fig. 3a) suggested that nsplO could 
play a role in viral RNA synthesis by binding directly to RNA. 
The affinity of nsplO for nucleic acid was estimated by electro¬ 
phoretic mobility shift assays (Fig. 4). nsplO displayed only 
micromolar affinity for single-stranded and double-stranded 
nucleic acids and did not exhibit a preference for RNA over 
DNA. Nucleic acid binding was not diminished by the presence 
of up to 250 mM sodium chloride in the binding buffer. The 
nsp9 RNA binding protein (9) also exhibited micromolar af¬ 
finities for nucleic acid with low substrate preference, charac¬ 
teristic of nonspecific binding. While the affinities presented 
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here for nsplO and previously for nsp9 would generally be 
considered insufficient to describe specific RNA-binding pro¬ 
teins, we are unable to rule out the possibility that nsplO 
contributes to a larger RNA-binding protein complex. 

nsplO also appears to contribute significantly to the cyto- 
pathogenic effects observed in SARS infection in lung tissue by 
interacting with the oxidoreductase system in mitochondria 
(17). Yeast two-hybrid experiments screening a human embryo 
lung cDNA library for interactions, using SARS-CoV nsplO as 
bait, revealed two subunits of cellular RNA polymerase B 
complex (BTF3 and ATF5), the NADH 4L subunit, and cyto¬ 
chrome oxidase II as hits (17). Specific interaction with cyto¬ 
chrome oxidase II was supported by a glutathione 5-transferase 
pull-down assay and Western blotting (17). Interestingly, tran¬ 
sient expression of SARS-CoV nsplO in KMB-17 cells inhibited 
replication of human coronavirus (HCoV) 229E (17), perhaps by 
competitively interfering with the HCoV replicase complex. 

DISCUSSION 

The SARS-CoV proteome contains a number of proteins 
with very low sequence similarity to other proteins. Even at the 
structural level, SARS-CoV proteins are proving to yield a 
significant number of new folds. To date, out of the nine 
proteins with available three-dimensional structural informa¬ 
tion (the ADP ribose 1-phosphatase [ADRP] domain of nsp3, 
nsp5, nsp7, nsp8, nsp9, nsplO, sars2, and sars7a and the N- 
terminal domain of sars9a), five contain new folds. Both in 
isolation (20) and within the nsp7-nsp8 hexadecamer super¬ 
complex (31), nsp7 adopts a novel fold consisting of four and 
three helices, respectively. In the supercomplex, nsp8 also con¬ 
sists of a new fold (31). It occurs in two distinctly different 
conformations: a “golf club”-like structure (composed of an 
N-terminal “shaft” domain and a C-terminal “head” domain) 
and a “bent golf club” conformation, with the long helical shaft 
bent in the middle into two shorter helices linked by a coil (31). 
Certain features of the nsp9 fold were found to bear some 
similarity to four different existing folds (trypsin-like proteases, 
the C-terminal domain of pi-transposase, a- and [3-subunits of 
Fl-ATP synthase-like, and small protein B) (9, 27). However, 
there are sufficient differences for it to be defined as a unique 
nucleic acid-binding fold. Finally, the nuclear magnetic reso¬ 
nance structure of the N-terminal domain of sars9a (nucleo- 
capsid protein) revealed a five-stranded |3-sheet topology not 
found in other RNA-binding proteins (12). As more structures 
of the SARS-CoV proteome and those of other viruses are 
structurally characterized, it will be interesting to see if the 
preponderance of uncommon folds is unique to coronaviruses 
or true of viruses in general. 

Conclusions. This study reports the first structural charac¬ 
terization of SARS-CoV nsplO, a protein critical to viral rep¬ 
lication that is highly conserved in coronaviruses. It plays a vital 
but as-yet uncharacterized role in negative-strand RNA syn¬ 
thesis by the replicase complex (25). nsplO adopts a tertiary 
fold that, to our knowledge, has not been previously observed. 
It contains two Zn fingers, the first of which belongs to a new 
Zn finger family. The presence of the two Zn fingers, a large 
conserved positively charged patch on the surface of the pro¬ 
tein, as well as its physical location within a cluster of RNA- 
binding/RNA-modifying proteins in the replicase polyprotein 


all seem to suggest an RNA-binding function. While the gel- 
shift assays reported here confirm the ability of purified nsplO 
to bind RNA, the observed interaction is weak and not se¬ 
quence specific. Previous immunolocalization and interaction 
studies have indicated that in vivo, nsplO acts in concert with 
numerous other viral proteins, counterparts of SARS-CoV 
nspl, nsp2, nsp5, nsp7, nsp8, nsp9, nspl2, nspl3, and sars9a (4, 
5). It is in the context of these other components of the rep¬ 
licase complex that the physiological role of nsplO is likely to 
become clearer. While speculative, it is highly likely that nsp9 
and nsplO might in fact bind to the hexameric nsp7-nsp8 su¬ 
percomplex as part of a larger structure that directs double- 
stranded RNA through its central channel towards the RNA 
polymerase for genome replication. While still in their infancy, 
experiments designed to study replicase assembly and function 
in SARS-CoV and related coronaviruses hold promise to elu¬ 
cidate this complex process in the viral life cycle. 
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